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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: GRIFFITHS, RICHARD 

(B) STREET: 20 (TR) Woodcroft Avenue, Broomhill, 

(C) CITY: Glasgow 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): Gil 7HX 

(A) NAME: TIWARI, BELA 

(B) STREET: 4 Upway Road 

(C) CITY: Oxford 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): 0X3 9QH 

(ii) TITLE OF INVENTION: AVIAN GHD GENES AND THEIR USE IN METHODS FOR 
SEX IDENTIFICATION IN BIRDS 

(iii) NUMBER OF SEQUENCES: 39 
(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(Vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: WO PCT/GB96/01341 

(B) FILING DATE:05-JUN-1996 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9511439.3 

(B) FILING DATE:06-JUN-1995 
(Vii) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: WENDEROTH , LIND & PONACK, L.L. 

(B) STREET: 2033 K. Street, N.W. , Suite 800, 

(C) CITY: Washington 

(D) STATE: D.C. 
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(F) ZIP: 20006 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Warren M Cheek, Jr. 

(B) REGISTRATION NUMBER: 33,367 

(C) REFERENCE/DOCKET NUMBER: 263/PPNTIR1172US 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: ( 202 ) -721-8200 

(B) TELEFAX: (202) -721-8250 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



LvV^Ow X V/UUn 


V7V3 X J- J. ^fXT\w w 


AATGACTAGA 

aa x w<r^\-» x nun 


TGTGGCACTT 

X VJ X vj VJ vxlv X X 


AGTGCCATGG 


TCTAGTTGAC 


60 


rinuu X \jrx X 


TTGGTCAAAA 


GTTGGACTCG 

w X X VJVJnv^ X vV3 


ATGATCTCAG 

XX X VJil X V»» X V«»xxVJ 


AGTTTTTTTC 


CAGCCTTAAT 


120 


AA 1 1 A X VJ A 


ATTPTGTAAT 

a x x v*. x vj x nn x 


XXX -fx X X X X VJ 


ATCTTTTTGA 

J* X V»» X X X X X VJfi 


GCGAAGTTTG 


TTTGGGGATT 


180 


TTAGTTTGGT 


TTCCCTGTCA 


CTGTTTTCTT 


TCCTTGAAAC 


TGACTTTCAT 


TTGCAACATG 


240 


AGAATTGCTG 


TATTTGTCAG 


GTTACAAGTA 


GTGCAATGGC 


TGCTTAGAAG 


TAGTGAGAAA 


300 


CATTTAGGGA 


AATACTGGAG 


TGAAGCAAAC 


ACAGTGGTAC 


TGCCAAACTG 


TAGCTTTGGG 


360 


ATTTGAGGAG 


CCACAGAGTT 


GTATATAAAT 


TTGTTTAATG 


ATATCCTGCC 


CCTGCCTTCC 


420 


ATTAATTGCT 


TGTTTTATGA 


AACCACTCTT 


TTTTTTTTTT 


TTTTTTTTT?T 


GGCTTCTTCA 


480 


TATCCTGTGG 


TAATGAGTTA 


ATGCATTTAG 


AAGCACATGG 


CAGAACTAGG 


AGATCTGTGG 


540 


ATGACAGTGG 


TACAGGAGCT 


CTGAATTTTT 


TAGATAAACT 


ATGAGAGTGG 


AAACAGAAAT 


600 


CTGAGGCTAG 


TTTCTTGAGC 


TGACTGTAAA 


TTTTGTGAGA 


ATATTTTCAA 


GACTACATTA 


660 


GTTGTGTGTT 


TGAGGAAAAA 


TAAAATGTTT 


AAGTTGTCCA 


TTCCTTGAAA 


CCTCCCGACC 


720 


GGG 












723 



(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATTCTTCCAG ATGATCCTGA TAAAAAACCA CAAGCAAAAC AGTTACAGAC CAAAAAACCA 60 
CAAGCAAAAC AGTTACAGAC CCGTGCAGAC TACCTCATCA AACTACTTAG CAGAGATCTT 120 
GCAAAAAGAG AGGCTCAGAG ACTTTGTGGT GCG 153 

(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATTTTACCTG ATGATCCAGA CAAGAAACCC CAGGCAAAGC AGCTACAGAC CAAGAAACCC 60 
CAGGCAAAGC AGCTACAGAC CCGTGCAGAC TACCTCATTA AATTACTGAA TAAAGACCTT 120 
GCAAGAAAGG AAGCACAAAG GCTTGCTGGT GCA 153 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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ATTTTACCTG ATGATCCAGA TAAGAAACCC CAGGCTAAGC AGTTACAGAC CAAGAAACCC 60 

CAGGCTAAGC AGTTACAGAC CCGTGCAGAT TACCTCATTA AATTACTGAA TAAAGACCTT 120 

GCAAGAAAGG AAGCACAGAG ACTTGCTGGT GCA 153 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATTTTACCTG ATGACCCAGA TAAGAAACCA CAGGCAAAGC AGTTGCAGAC CAAGAAACCA 60 
CAGGCAAAGC AGTTGCAGAC CCGTGCAGAT TACCTCATTA AATTACTGAA TAAAGACCTT 120 
GCAAGAAAAG AAGTGCAAAG ACTTACTGGT GCA 153 



(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

lie Leu Pro Asp Asp Pro Asp Lys Lys Pro Gin Ala Lys Gin Leu Gin 



- 56 - 

15 10 15 

Thr Arg Ala Asp Tyr Leu lie Lys Leu Leu Ser Arg Asp Leu Ala Lys 

20 25 30 

Arg Glu Ala Gin Arg Leu Cys Gly Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

lie Leu Pro Asp Asp Pro Asp Lys Lys Pro Gin Ala Lys Gin Leu Gin 

15 10 15 

Thr Arg Ala Asp Tyr Leu lie Lys Leu Leu Asn Lys Asp Leu Ala Arg 

20 25 30 

Lys Glu Ala Gin Arg Leu Ala Gly Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

lie Leu Pro Asp Asp Pro Asp Lys Lys Pro Gin Ala Lys Gin Leu Gin 

15 10 15 

Thr Arg Ala Asp Tyr Leu lie Lys Leu Leu Asn Lys Asp Leu Ala Arg 

20 25 30 

Lys Glu Ala Gin Arg Leu Ala Gly Ala 
35 40 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
lie Leu Pro Asp Asp Pro Asp Lys Lys Pro Gin Ala Lys Gin Leu Gin 
15 10 15 

Thr Arg Ala Asp Tyr Leu lie Lys Leu Leu Asn Lys Asp Leu Ala Arg 

20 25 30 

Lys Glu Val Gin Arg Leu Thr Gly Ala 
35 40 
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(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6608 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGGGCTGCGG CACGAAGCGC ACCGCCGGCG CACGCAGGCT CGGGCCGGGG AAGGCCTGGC 60 
CCGCCGAGCC GGACGCACGC AGGTATTTGG GCAAAAATCT TGGCCATCTG TAGAGAATAG 120 
CAAGT CAAAC GCATTACTTC GAAAACATAC GGAGTACCAG AAAGGGGATT CTTGACCTAC 180 
ACCTTGTAAC CTGAGTGGAC TTTCTTTTTA ACTTCTTAAT ACTTACAATG AATGGGCACA 240 
GTGATGAAGA AAGTGTAAGA AACAGCAGTG GAGAGTCAAG CAGATCAGAT GATGATTCTG 300 
GGTCAGCTTC AGGTTCTGGA TCTGGTTCAA GCTCTGGAAG CAGTAGCGAT GGAAGTAGCA 360 
GCCAGTCAGG TAGCAGTGAC TCTGAATCTG GTTCAGAGTC AGGCAGTCAA TCCGAATCAG 420 
AGTCTGACAC ATCTAGAGAG AAGAAACAAG TTCAAGCTAA ACCTCCGAAA GCTGACGGAT 480 
CTGAGTTTTG GAAGTCCAGT CCAAGCATAC TTGCTGTACA GAGATCAGCA GTG CTCAAGA 540 
AGCAACAGCA ACAGCAAAAA GCAGCATCAT CAGACAGTGG TTCAGAAGAG GACTCATCCA 600 
GTAGTGAAGA TTCTGCCGAT GATTCGTCCA GTGAAACTAA GAAGAAAAAG CATAAAGATG 660 
AAGACTGGCA AATGTCAGGG TCAGGGTCAG TATCAGGAAC TGGTTCTGAT TCTGAATCGG 720 
CGGAAGATGG GGATAAAAGC AGTTGTGAAG AAAGTGAATC TGACTATGAG CCAAAAAACA 780 
AAGTCAAAAG CCGTAAACCT CCAAGCAGAA TTAAGCCAAA AAGTGGGAAA AAGAGCACAG 840 
GACAGAAGAA GAGGCAACTT GATTCATCAG AGGAGGAGGA GGACGATGAT GAAGATTATG 900 
ATAAGAGAGG ATCTCGTCGC CAGGCAACAG TGAATGTTAG TTACAAAGAA GCTGAAGAAA 960 
CCAAGACAGA TTCTGATGAT TTGCTGGAAG TTTGTGGAGA GGATGTCCCA CAGACTGAAG 1020 
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AAGATGAATT TGAAACTATA GAGAAGTTTA TGGACAGTCG AATTGGCCGA AAAGG AG CCA 1080 

CTGGTGCCTC AACCACCATC TATGCCGTTG AGGCAGATGG TGACCCAAAT GCTGGGTTTG 1140 

AAAAGTCAAA GGAGCTGGGA GAAATACAGT ATCTTATTAA ATGGAAAGGC TGGTCACACA 1200 

TCCATAACAC TTGGGAAACT GAAGAAACGC TGAAGCAACA AAATGTTAAA GGAATGAACA 1260 

AACTGGACAA CTACAAGAAA AAGGATCAGG AGACAAAACG CTGGCTGAAA AATGCTTCTC 1320 

CAGAAGATGT GGAATATTAT AACTGCCAGC AGGAGCTTAC AGATGATCTG CACAAACAAT 1380 

ATCAAATAGT GGAAAGAATA ATTGCTCATT CAAATCAAAA GTCAGCAGCT GGTTATCCGG 1440 

ACTACTATTG CAAATGGCAG GGTCTGCCTT ACTCAGAATG TAGCTGGGAA GATGGTGCTC 1500 

TCATTGCCAA AAAGTTTCAG GCACGCATTG ATGAGTATTT TAGCAGAAAT CAATCCAAGA 1560 

CTACTCCCTT TAAGGACTGC AAGGTTCTAA AACAGAGACC AAGATTTGTT GCACTGAAGA 1620 

AGCAACCATC TTACATTGGA GGACATGAAA GTCTGGAGTT AAGAGATTAT CAGTTAAATG 1680 

GATTGAATTG GCTCGCTCAT TCATGGTGCA AAGGAAATAG TTGTATTCTT GCAGATGAAA 1740 

TGGGTCTGGG TAAAACAATA CAAACAATTT CTTTTCTGAA CTACCTGTTT CATGAACATC 1800 

AACTGTATGG CCCTTTTCTT CTGCGCGTGC CACTTTCTAC CTTGACATCT TGGCAAAGAG 1860 

AGATTCAAAC TTGGGCTCCT CAGATGAATG CTGTAGTTTA CTTAGGAGAT ATAACTAGTA 1920 

GAAATATGAT AAGGACTCAT GAATGGATGC ATCCACAGAC TAAACGATTA AAGTTTAACA 1980 

TACTTCTGAC GACATATGAA ATTTTACTGA AGGATAAGTC ATTCCTTGGT GGTCTCAATT 2040 

GGGCATTCAT AGGAGTTGAT GAAGCTCATC GTTTAAAAAA TGATGACTCT CTTCTGTACA 2100 

GGACTTTAAT AGACTTTAAG TCCAACCATC GACTTCTGAT TACTGGAACC CCACTGCAAA 2160 

ATTCCCTCAA AGAGCTGTGG TCTTTGTTGC ATTTCATCAT GCCAGAAAAA TTTTCCTCCT 2220 

GGGAAGATTT TGAAGAGGAG CATGGCAAAG GAAGAGAGTA TGGTTATGCA AGTCTTCACA 2280 

AAGAGCTTGA ACCATTTTTA CTAAGAAGAG TTAAAAAAGA TGTAGAAAAG TCTTTACCTG 2340 

CTAAGGTTGA ACAAATTCTG AGGATGGAAA TGAGTG CATT GCAGAAGCAA TATTACAAGT 2400 

GGATTTTAAC AAGGAATTAT AAAGCCCTCA GTAAAGGTTC AAAAGGCAGT ACCTCAGGCT 2460 

TTCTGAACAT TATGATGGAA CTTAAGAAGT GTTGTAACCA TTGCTACCTC ATTAAGCCAC 2520 
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CAGATGATAA TGAATTCTAT AATAAACAGG AGGCCTTACA GCATTTGATA CGTAGCAGCG 2580 

GGAAACTAAT CCTTCTTGAC AAGCTACTGA TTCGTCTGCG AGAACGTGGC AACAGAGTTC 2640 

TGATTTTCTC TCAGATGGTG AGGATGCTGG ACATCCTAGC AGAATATCTG AAGTATCGCC 2700 

AGTTTCCCTT CCAGAGACTT GATGGATCAA TAAAAGGGGA ATTGAGGAAG CAAGCACTGG 2760 

ATCATTTCAA TGCAGAAGGA TCAGAGGATT TCTGTTTTTT ACTGTCTACA AGAGCTGGAG 2820 

GATTAGGTAT TAACTTGGCA TCTGCTGACA CTGTAGTTAT TTTTGATTCT GACTGGAATC 2880 

CACAGAATGA TCTGCAGGCA CAGGCGAGAG CTCATAGAAT TGGACAGAAG AAACAGGTTA 2940 

ATATTTATCG GCTAGTCACA AAAGGATCAG TAGAAGAAGA TATTCTTGAA AGAGCCAAGA 3000 

AGAAGATGGT GCTAGACCAT TTAGTAATTC AGAGAATGGA CACGACAGGA AAAACTGTTC 3060 

TGCATACAGG TTCAACTCCA TCAAGCTCTA CACCTTTTAA TAAAGAAGAG TTATCAGCTA 3120 

TTTTGAAGTT TGGTGCTGAG GAACTCTTTA AAGAACCTGA AGGAGAAGAA CAGGAGCCCC 3180 

AGGAAATGGA TATAGATGAA ATCTTGAAGA GAGCTGAAAC TCGGGAAAAT GAGCCAGGTC 3240 

CATTGACTGT AGGGGATGAG TTGCTTTCAC AGTTCAAGGT GGCGAACTTT TCCAATATGG 3300 

ATGAAGATGA TATTGAGTTG GAACCAGAAA GAAATTCAAG AAATTGGGAA GAAATCATCC 33 60 

CAGAATCCCA ACGGAGAAGG ATAGAGGAGG AGGAAAGACA AAAAGAACTT GAAGAAATAT 3420 

ACATGCTCCC GAGGATGAGA AACTGTGCAA AACAGATCAG CTTTAATGGG AGTGAAGGAA 3480 

GACGCAGTAG GAGCAGAAGA TATTCTGGAT CTGATAGTGA CTCCATCACA GAAAGAAAAC 3540 

GGCCAAAAAA GCGTGGAAGA CCTCGAACCA TTCCTCGAGA AAATATTAAA GGATTTAGTG 3600 

ATGCAGAGAT CAGGCGGTTT ATCAAGAGTT ACAAGAAATT TGGTGGCCCT CTGGAAAGGT 3660 

TAGATGCTGT AGCTAGAGAT GCTGAACTGG TTGATAAATC TGAGACAGAC CTTAGACGTT 3720 

TGGGTGAACT TGTACATAAT GGATGCATTA AGGCTTTAAA GGACAATTCA TCTGGACAAG 3780 

AAAGAGCAGG AGGTAGACTT GGGAAAGTTA AAGGCCCAAC GTTTCGAATC TCAGGAGTGC 3840 

AGGTGAATGC AAAACTAGTC ATCTCTCACG AAGAAGAGCT GGCACCACTG CACAAATCCA 3900 

TTCCTTCAGA TCCAGAAGAA AGGAAAAGAT ATGTCATCCC ATGCCACACC AAGGCTGCTC 3960 

ACTTCGATAT AGATTGGGGT AAAGAAGATG ATTCCAATCT GTTAGTAGGC ATCTATGAAT 4020 
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ATGGCTATGG CAGCTGGGAA ATGATAAAAA TGGATCCAGA TCTCAGCTTA ACACAGAAGA 4080 

TTTTACCTGA TGATCCAGAC AAGAAACCCC AGGCAAAGCA GCTACAGACC CGTGCAGACT 4140 

ACCTCATTAA ATTACTGAAT AAAGACCTTG CAAGAAAGGA AGCACAAAGG CTTGCTGGTG 4200 

CAGGCAATTC CAAGAGAAGG AAGACAAGAA ATAAGAAGAA TAAGATGAAG GCTTCAAAAA 4260 

TAAAAGAAGA AATAAAGAGT GATTCTTCAC CACAACCCTC AGAAAAATCT GATGAAGATG 4320 

ATGAGGAGGA GGATAACAAG GTAAATGAAA TGAAATCTGA AAATAAAGAA AAATCTAAAA 4380 

AAATTCCATT GCTGGATACT CCAGTTCATA TTACTGCAAC CAGTGAACCA GTTCCTATCT 4440 

CAGAAGAATC TGAAGAACTC CATCAGAAGA CATTTAGTGT GTGCAAAGAA AGAATGAGGC 4500 

CTGTCAAAGC AGCACTGAAA CAGCTGGATA GACCAGAGAA GGGCCTTTCT GAAAGGGAGC 4560 

AGCTGGAACA TACTAGGCAG TGTCTAATCA AAATTGGGGA TCACATTACA GAATGCCTGA 4620 

AGGAGTACAC AAATCCCGAG CAAATAAAAC AGTGGAGGAA AAATTTGTGG ATTTTTGTGT 4680 

CCAAGTTTAC AGAATTTGAT GCCAGAAAGC TGCACAAACT CTACAAACAT GCAATCAAAA 4740 

AGCGCCAAGA GTCTCAGCAA CACAATGACC AAAACATTAG CAGCAATGTG AATACACATG 4800 

TAATCAGAAA TCCAGATGTG GAAAGACTGA AGGAGACTAC AAACCATGAT GATAGTAGCA 4860 

GGGACAGTTA TTCTTCTGAT AGACATTTAT CACAATACCA TGATCATCAC AAAGACAGGC 4920 

ATCAGGGAGA TGCTTACAAG AAAAGTGACT CCAGGAAAAG GCCATATTCA GCCTTCAGTA 4980 

ATGGAAAAGA TCACAGAGAC TGGGATCACT ACAAACAGGA CAGCAGATAC TACAGTGATA 5040 

GTAAACATAG AAAGTTAGAT GACCACAGGA GCAGAGACCA CAGGTCAAAC CTGGAAGGAA 5100 

ACTTAAAAGA CAGCCGGGGT CATTCAGATC ACCGCTCCCA TTCAGACCAC AGGATACACT 5160 

CAGATCACCG TTCCACTTCA GAATACAGCC ATCATAAATC TTCGAGAGAT TATAGATACC 5220 

ACTCAGACTG GCAAATGGAC CACAGAGCTT CTGGTAGTGG CCCGAGGTCA CCACTAGATC 5280 

AGAGGTCTCC TTATGGTTCA AGATCTCCCC TAGGACACAG ATCTCCATTT GAACACTCAT 5340 

CAGATCACAA AAGTACACCT GAACATACAT GGAGTAGCCG GAAGACATAA CAAAGACTGA 5400 

CATTTTCTGG ACCTTCTTTT TAGCCATATA CAGTAAACTA ACACAGTAAT TGCCTTACAT 5460 

GACTTGAAAG ATATGGACTG GATATTCTAT CAGTAGCAGT ATTGTTACTT CTTTCCAGGA 5520 
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TGCAAGGTCT ATTATCCCAA CAGAAGAAAA ATATTTTTGT ATTTAAAGTT TATGCTGCAC 5580 

TGTGCTGCAA ATGTTGTGGC ACTTTTTTTT TAAGAAATGG AAGATGTTTA CTTTTACAGG 5640 

GACCTCAACA CTGCCCCTTT CAGACTGGAT CTTACTATAA AACTCTTCAT GTCAAAGTGG 5700 

TTCTAGGCTG AACACAGATT AAATTATGTT TGTAAATGAA CACTTAAACA CTGACCTGTG 5760 

CTTATGTTTC AGGAAAGAAT GGGGGATTTA TTTTGTTTTA TTTCTTGGTA GAGAACTCTC 5820 

AAGGACTTTG TTCACTTTCC AAAGCTACTT GTTTACATTG TACACTGCGA CCACCTTGCC 5880 

GCTTTTCATC ACAAGCTTGA ATATTTAAAT TCTGTACCTA CAGTTGTAAA ATAGCCAGGA 5940 

TTTCTCCTGT TTGTGATCAG TTATAATGCC TTTTTATGAA ACAAACAAAC AAACAAAAAA 6000 

CAATTAAAAA AAAAAACACA ACAAAACCAA CAAATGGCTG TAAATTATTG TAAATTAATT 6060 

AAATGAGCTT TTTTCCGTCA GGCTTTTTTT GGCTGTTCCT TTCCCCAACA ACTCAGGCCT 6120 

TCTTTTCACA AAGTCAGTAT ACTTACATGT TTTAATAAAA TATCTCGATG GAATCAGAAT 6180 

GTAAAAATGG GGAAGGGAAT ATTTTATTCC ATTTAGTGCT CCTTTTTTAT TGGATACTTT 6240 

TACATACCTG TTTTTGGTTG TTTTATTTTA TTTTTTTTTT CTATTAAACT GTCAGTGTTG 63 00 

TGATTGTTGT AATGAACAGT GAGAATATCC CACTCTAAAC TGTGCCCTGG AAAGCTTTTC 63 60 

AGGTGCATTG GTTTAAAAGA AGGAAGTGTT CTATAGGTGA ACACTTCAAA ACCCAGATCA 6420 

GCCAAGATTC ATTGTAAATC CATTTGTTTT CCCTCTTTAA CATGGGCAAT AATGTCAAAT 6480 

GTGCTATGCA GCAGTTAATA TTTTAGAAGA TTTGAATGAC TTTATTAACA GAATTGTTAC 6540 

AATGCACACT GATTGTACAT AGATAACTTC TATCTGACAA ATTAAATTAA CTAAAACCAA 6600 

AAAAAACC 6608 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asp Glu lie Val Ser Val Lys His Leu His Lys Lys lie Lys Thr Glu 

15 10 15 

Lys Glu Asn Glu Glu Lys Pro Glu Pro Asp lie Gly lie Lys Lys Glu 

20 25 30 

Ala Glu Glu Lys Arg Glu Thr Lys Glu Lys Glu Asn Lys Arg Glu Leu 

35 40 45 

Lys Arg Glu Lys Lys Glu Lys Glu Asp Lys Lys Glu Leu Lys Glu Lys 

50 55 60 

Asp Asn Lys Glu Lys Arg Glu Asn Lys Val Lys Glu Ser Thr Gin Lys 
65 70 75 80 

Glu Lys Glu Val Lys Glu Glu Lys 

85 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GATGAGATTG TTTCAGTGAA ACATCTACAT AAAAAAATAA AAACAGAAAA AAGAAAATGA 
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AGAAAAGCCT GAGCCAGATA TTGGTATAAA GAAGGAAGCT GAAGAAAAAA GAGAGACAAA 



120 



AGAGAAGGAA AATAAAAGGG AATTGAAAAG GGAGAAAAAA GAAAAAGAGG ATAAGAAAGA 



180 



ATTAAAAGAA AAAGATAATA AAGAAAAGAG AGAAAACAAA GTAAAAGAAT CCACACAGAA 



240 



AGAAAAAGAA GTGAAGGAAG AGAAG 



265 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GATGGGATTG TTTCAGTGAA ACATCCACAT AAAAAAATAA AAGCAGAAAA AAGAAAATGA 60 
AGAAAAAGAT GAGCCAGAGA TTGGTATAAA GAAGGAAGCT GGAGAAAAAA GAGAGACAAA 120 
AGAAAAGGAA AATAAGA 137 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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Asp Gly lie Val Ser Val Lys His Pro His Lys Lys lie Lys Ala Glu 

15 10 15 

Lys Glu Asn Glu Glu Lys Asp Glu Pro Glu lie Gly lie Lys Lys Glu 

20 25 30 

Ala Gly Glu Lys Arg Glu Thr Lys Glu Lys Glu Asn Lys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATTTATCGGC TAGTCACAAA AGGATCAGTA GAAGAAGATA TTCTTGAAAG AGCCAAGAAA 60 
AAGATGGTGT TAGATCATTT AGTGATTCAG AGAATGGACA CCACAGGGAA AACTGTACTA 120 
CATACAGGCT CTACTCCTTC AAGCTCAACA CCTTTTAATA AGGAAGAGTT ATCAGCAATT 180 
TTGAAGTTTG GTGCTGAGGA ACTTTTTAAA GAACCTGAAN NNGAAGAAGA GGAGCCTCAG 240 
GAGATGGATA TAGATGAAAT CCTGAAGAGG NCTGAAACTC GAGAAAATGA GTCAGGCCCA 300 
TTAACTGTAG GAGATGAGTT ACTTTCACAG TTCAAGGTAG CTAACTTTTC CAATATGGAT 360 
GAAGATGACA TTGAATTGGA ACCAGAACAA AATCTAAGAA ACTGGGAAGA AATCATTCCA 420 
GAAGTTCAGT GGCGACGAAT AGAGGGGNNG GAAAGACAAA AAGAACTTGA AGAAATATAT 480 
ATGCTTCCAA GAATGAGAAA CTGTGCAAAA CAGATCAGCT TTAATGGAAA TGAAGGGAGA 540 
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TGCAGTAGGA GCAGAAGATA TTCTGGATCT GATAGTGATT CCATCTCAGA AAGAAAACGA 600 

CCAAAAAAAC GTGGACGACC ACGAACTATT CCCCGTGAAA ACATTAAAGG ATTTAGTGAT 660 

GCAGAGATTA GACGATTTAT CAAGAGTTAC AAGAAATTTG GTGGCCCAGT TGAAAGGTTA 720 

GATGCTATAG CTAGAGATGC TGAGCTAGTT GATAAATCTG AAACAGACCT TAGACGTCTG 780 

GGAGAACTTG TACATAATGG ATGCATTAAG GCTTTAAATG ATAATGACTT TGGTCAAGGA 840 

AGAACAGGTG GTAGATTTGG GAAAGTTAAA GGCCCAACAT TCCGAATAGC AGGAGTGCAG 900 

GTGAATGCAA AGCTAGTCAT TTCTCACGAA GAAGAGTTGG CACCATTGCA TAAATCGATT 960 

CCTTCAGATC CAGAAGAAAG GAAAAGATAT GTCATCCCAT ACCACACCAA AGCAGCTCAT 1020 

TTTGATATAG ATTGGGGTAA AGAAGATGAT TCCAATCTGT TAATAGGCAT CTATGAATAT 1080 

GGTTATGGCA GTTGGGAAAT GATAAAAATG GATCCTGATC TCAGTTTGAC ACAGAAGATT 1140 

TTACCTGATG ATCCAGATAA GAAACCCCAG GCTAAGCAGT T AC AG ACT CG TGCAGATTAC 1200 

CTCATTAAAT TACTGAATAA AGACCTTGCA AGAAAGGAAG CACAGAGACT TGCTGGTGCA 1260 

GGCAATTCAA AGAGGAGAAA AACAAGAAGT AAGAAGAATA AAGCAACAAA GGCTGC 1316 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Asp Ala Arg Arg Tyr Leu Gly Lys Asn Leu Gly His Leu Arg lie Ala 

15 10 15 

Ser Gin Thr His Tyr Phe Glu Asn lie Arg Ser Thr Arg Lys Gly lie 



20 

Leu Asp Leu His 
35 

Leu Thr Met Asn 
50 

Gly Glu Ser Ser 
65 

Ser Gly Ser Gly 

Gin Ser Gly Ser 
100 

Ser Glu Ser Glu 
115 

Lys Pro Pro Lys 
130 

He Leu Ala Val 
145 

Gin Lys Ala Ala 

Ser Glu Asp Ser 
180 

His Lys Asp Glu 
195 

Thr Gly Ser Asp 
210 



Leu Val Thr Val 
40 

Gly His Ser Asp 
55 

Arg Ser Asp Asp 
70 

Ser Ser Ser Gly 
85 

Ser Asp Ser Glu 

Ser Asp Thr Ser 
120 

Ala Asp Gly Ser 
135 

Gin Arg Ser Ala 
150 

Ser Ser Asp Ser 
165 

Ala Asp Asp Ser 

Asp Trp Gin Met 
200 

Ser Glu Ser Ala 
215 



67 - 
25 

Asp Phe Leu Phe 

Glu Glu Ser Val 
60 

Asp Ser Ala Gly 

75 

Ser Ser Ser Asp 
90 

Ser Gly Ser Glu 
105 

Arg Glu Lys Lys 

Glu Phe Trp Lys 
140 

Val Leu Lys Lys 
155 

Gly Ser Glu Glu 
170 

Ser Ser Glu Thr 
185 

Ser Gly Ser Gly 

Glu Asp Gly Asp 

220 



30 

Asn Phe Leu He 
45 

Arg Asn Ser Ser 

Ser Ala Ser Gly 
80 

Gly Ser Ser Ser 
95 

Ser Gly Ser Gin 
110 

Gin Val Gin Ala 
125 

Ser Ser Pro Ser 

Gin Gin Gin Gin 
160 

Asp Ser Ser Ser 
175 

Lys Lys Lys Lys 
190 

Ser Val Ser Gly 
205 

Lys Ser Ser Cys 
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Glu Glu Ser Glu Ser Asp Tyr Glu Pro Lys Asn Lys Val Lys Ser Arg 
225 230 235 240 

Lys Pro Pro Ser Arg lie Lys Pro Lys Ser Gly Lys Lys Ser Thr Gly 

245 250 255 

Gin Lys Lys Arg Gin Leu Asp Ser Ser Glu Glu Glu Glu Asp Asp Asp 

260 265 270 

Glu Asp Tyr Asp Lys Arg Gly Ser Arg Arg Gin Ala Thr Val Asn Val 

275 280 285 

Ser Tyr Lys Glu Ala Glu Glu Thr Lys Thr Asp Ser Asp Asp Leu Leu 

290 295 300 

Glu Val Cys Gly Glu Asp Val Pro Gin Thr Glu Glu Asp Glu Phe Glu 
305 310 315 320 

Thr lie Glu Lys Phe Met Asp Ser Arg lie Gly Arg Lys Gly Ala Thr 

325 330 335 

Gly Ala Ser Thr Thr lie Tyr Ala Val Glu Ala Asp Gly Asp Pro Asn 

340 345 350 

Ala Gly Phe Glu Lys Ser Lys Glu Leu Gly Glu lie Gin Tyr Leu lie 

355 360 365 

Lys Trp Lys Gly Trp Ser His lie His Asn Thr Trp Glu Thr Glu Glu 

370 375 380 

Thr Leu Lys Gin Gin Asn Val Lys Gly Met Asn Lys Leu Asp Asn Tyr 
385 390 395 400 

Lys Lys Lys Asp Gin Glu Thr Lys Arg Trp Leu Lys Asn Ala Ser Pro 

405 410 415 

Glu Asp Val Glu Tyr Tyr Asn Cys Gin Gin Glu Leu Thr Asp Asp Leu 



420 

His Lys Gin Tyr 
435 

Lys Ser Ala Ala 
450 

Pro Tyr Ser Glu 
465 

Phe Gin Ala Arg 

Thr Pro Phe Lys 
500 

Ala Leu Lys Lys 
515 

Leu Arg Asp Tyr 
530 

Cys Lys Gly Asn 
545 

Thr lie Gin Thr 

Leu Tyr Gly Pro 
580 

Trp Gin Arg Glu 
595 

Tyr Leu Gly Asp 
610 



Gin lie Val Glu 
440 

Gly Tyr Pro Asp 
455 

Cys Ser Trp Glu 
470 

lie Asp Glu Tyr 
485 

Asp Cys Lys Val 

Gin Pro Ser Tyr 
520 

Gin Leu Asn Gly 

535 

Ser Cys lie Leu 
550 

lie Ser Phe Leu 
565 

Phe Leu Leu Arg 

lie Gin Thr Trp 
600 

lie Thr Ser Arg 
615 



I 

69 - 
425 

Arg lie lie Ala 

Tyr Tyr Cys Lys 
460 

Asp Gly Ala Leu 

475 

Phe Ser Arg Asn 
490 

Leu Lys Gin Arg 
505 

lie Gly Gly His 

Leu Asn Trp Leu 
540 

Ala Asp Glu Met 

555 

Asn Tyr Leu Phe 
570 

Val Pro Leu Ser 
585 

Ala Pro Gin Met 

Asn Met lie Arg 
620 



430 

His Ser Asn Gin 
445 

Trp Gin Gly Leu 

lie Ala Lys Lys 
480 

Gin Ser Lys Thr 
495 

Pro Arg Phe Val 
510 

Glu Ser Leu Glu 
525 

Ala His Ser Trp 

Gly Leu Gly Lys 
560 

His Glu His Gin 
575 

Thr Leu Thr Ser 
590 

Asn Ala Val Val 
605 

Thr His Glu Trp 



Met His Pro Gin 

625 

Tyr Glu lie Leu 

Ala Phe He Gly 
660 

Leu Leu Tyr Arg 
675 

He Thr Gly Thr 
690 

Leu His Phe He 
705 

Glu Glu His Gly 

Glu Leu Glu Pro 
740 

Ser Leu Pro Ala 

755 

Leu Gin Lys Gin 
770 

Leu Ser Lys Gly 

785 

Met Glu Leu Lys 
Asp Asp Asn Glu 



Thr Lys Arg Leu 
630 

Leu Lys Asp Lys 
645 

Val Asp Glu Ala 

Thr Leu He Asp 
680 

Pro Leu Gin Asn 

695 

Met Pro Glu Lys 
710 

Lys Gly Arg Glu 
725 

Phe Leu Leu Arg 

Lys Val Glu Gin 
760 

Tyr Tyr Lys Trp 
775 

Ser Lys Gly Ser 
790 

Lys Cys Cys Asn 
805 

Phe Tyr Asn Lys 



Lys Phe Asn He 
635 

Ser Phe Leu Gly 
650 

His Arg Leu Lys 
665 

Phe Lys Ser Asn 

Ser Leu Lys Glu 
700 

Phe Ser Ser Trp 
715 

Tyr Gly Tyr Ala 
730 

Arg Val Lys Lys 
745 

He Leu Arg Met 

He Leu Thr Arg 
780 

Thr Ser Gly Phe 

795 

His Cys Tyr Leu 
810 

Gin Glu Ala Leu 



Leu Leu Thr Thr 
640 

Gly Leu Asn Trp 
655 

Asn Asp Asp Ser 
670 

His Arg Leu Leu 
685 

Leu Trp Ser Leu 

Glu Asp Phe Glu 
720 

Ser Leu His Lys 
735 

Asp Val Glu Lys 
750 

Glu Met Ser Ala 
765 

Asn Tyr Lys Ala 

Leu Asn lie Met 
800 

He Lys Pro Pro 
815 

Gin His Leu He 



820 

Arg Ser Ser Gly 
835 

Arg Glu Arg Gly 
850 

Leu Asp lie Leu 
865 

Arg Leu Asp Gly 

His Phe Asn Ala 
900 

Arg Ala Gly Gly 
915 

lie Phe Asp Ser 
930 

Arg Ala His Arg 
945 

Val Thr Lys Gly 

Lys Met Val Leu 
980 

Lys Thr Val Leu 
995 

Asn Lys Glu Glu 
1010 



Lys Leu lie Leu 
840 

Asn Arg Val Leu 
855 

Ala Glu Tyr Leu 
870 

Ser lie Lys Gly 
885 

Glu Gly Ser Glu 

Leu Gly lie Asn 
920 

Asp Trp Asn Pro 
935 

lie Gly Gin Lys 
950 

Ser Val Glu Glu 
965 

Asp His Leu Val 

His Thr Gly Ser 
100 

Leu Ser Ala lie 
1015 



825 

Leu Asp Lys Leu 

lie Phe Ser Gin 
860 

Lys Tyr Arg Gin 

875 

Glu Leu Arg Lys 
890 

Asp Phe Cys Phe 
905 

Leu Ala Ser Ala 

Gin Asn Asp Leu 
940 

Lys Gin Val Asn 
955 

Asp lie Leu Glu 
970 

lie Gin Arg Met 
985 

Ala Pro Ser Ser 

i 

Leu Lys Phe Gly 
102 



830 

Leu lie Arg Leu 
845 

Met Val Arg Met 

Phe Pro Phe Gin 
880 

Gin Ala Leu Asp 
895 

Leu Leu Ser Thr 
910 

Asp Thr Val Val 
925 

Gin Ala Gin Ala 

lie Tyr Arg Leu 
960 

Arg Ala Lys Lys 
975 

Asp Thr Thr Gly 
990 

Ser Thr Pro Phe 
1005 

Ala Glu Glu Leu 



Phe Lys Glu Pro Glu Gly Glu Glu Gin Glu Pro Gin Glu Met Asp lie 
1025 1030 1035 1040 

Asp Glu lie Leu Lys Arg Ala Glu Thr His Glu Asn Glu Pro Gly Pro 

1045 1050 1055 

Leu Ser Val Gly Asp Glu Leu Leu Ser Gin Phe Lys Val Ala Asn Phe 

1060 1065 1070 

Ser Asn Met Asp Glu Asp Asp lie Glu Leu Glu Pro Glu Arg Asn Ser 

1075 1080 1085 

Lys Asn Trp Glu Glu lie lie Pro Glu Glu Gin Arg Arg Arg Leu Glu 

1090 1095 1100 

Glu Glu Glu Arg Gin Lys Glu Leu Glu Glu lie Tyr Met Leu Pro Arg 
1105 1110 1115 1120 

Met Arg Asn Cys Ala Lys Gin lie Ser Phe Asn Gly Ser Glu Gly Arg 

1125 1130 1135 

Arg Ser Arg Ser Arg Arg Tyr Ser Gly Ser Asp Ser Asp Ser lie Ser 

1140 1145 1150 

Glu Arg Lys Arg Pro Lys Lys Arg Gly Arg Pro Arg Thr lie Pro Arg 

1155 1160 1165 

Glu Asn lie Lys Gly Phe Ser Asp Ala Glu lie Arg Arg Phe lie Lys 

1170 1175 1180 

Ser Tyr Lys Lys Phe Gly Gly Pro Leu Glu Arg Leu Asp Ala lie Ala 
1185 1190 1195 1200 

Arg Asp Ala Glu Leu Val Asp Lys Ser Glu Thr Asp Leu Arg Arg Leu 

1205 1210 1215 

Gly Glu Leu Val His Asn Gly Cys Val Lys Ala Leu Lys Asp Ser Ser 



1220 1225 1230 

Ser Gly Thr Glu Arg Ala Gly Gly Arg Leu Gly Lys Val Lys Gly Pro 

1235 1240 1245 

Thr Phe Arg lie Ser Gly Val Gin Val Asn Ala Lys Leu Val lie Ala 

1250 1255 1260 

His Glu Asp Glu Leu lie Pro Leu His Lys Ser lie Pro Ser Asp Pro 
1265 1270 1275 1280 

Glu Glu Arg Lys Gin Tyr Thr lie Pro Cys His Thr Lys Ala Ala His 

1285 1290 1295 

Phe Asp lie Asp Trp Gly Lys Glu Asp Asp Ser Asn Leu Leu lie Gly 

1300 1305 1310 

lie Tyr Glu Tyr Gly Tyr Gly Ser Trp Glu Met lie Lys Met Asp Pro 

1315 1320 1325 

Asp Leu Ser Leu Thr His Lys lie Leu Pro Asp Asp Pro Asp Lys Lys 

1330 1335 1340 

Pro Gin Ala Lys Gin Leu Gin Thr Arg Ala Asp Tyr Leu lie Lys Leu 
1345 1350 1355 1360 

Leu Ser Arg Asp Leu Ala Lys Arg Glu Ala Gin Arg Leu Cys Gly Ala 

1365 1370 1375 

Gly Gly Ser Lys Arg Arg Lys Thr Arg Ala Lys Lys Ser Lys Ala Met 

1380 1385 1390 

Lys Ser lie Lys Val Lys Glu Glu lie Lys Ser Asp Ser Ser Pro Leu 

1395 1400 1405 

Pro Ser Glu Lys Ser Asp Glu Asp Asp Asp Lys Leu Asn Asp Ser Lys 
1410 1415 1420 
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Pro Glu Ser Lys Asp Arg Ser Lys Lys Ser Val Val Ser Asp Ala Pro 
1425 1430 1435 1440 

Val His lie Thr Ala Ser Gly Glu Pro Val Pro lie Ala Glu Glu Ser 

1445 1450 1455 

Glu Glu Leu Asp Gin Lys Thr Phe Ser lie Cys Lys Glu Arg Met Arg 

1460 1465 1470 

Pro Val Lys Ala Ala Leu Lys Gin Leu Asp Arg Pro Glu Lys Gly Leu 

1475 1480 1485 

Ser Glu Arg Glu Gin Leu Glu His Thr Arg Gin Cys Leu lie Lys lie 

1490 1495 1500 

Gly Asp His lie Thr Glu Cys Leu Lys Glu Tyr Ser Asn Pro Glu Gin 
1505 1510 1515 1520 

lie Lys Gin Trp Arg Lys Asn Leu Trp lie Phe Val Ser Lys Phe Thr 

1525 1530 1535 

Glu Phe Asp Ala Arg Lys Leu His Lys Leu Tyr Lys His Ala lie Lys 

1540 1545 1550 

Lys Arg Gin Glu Ser Gin Gin Asn Ser Asp Gin Asn Ser Asn Val Ala 

1555 1560 1565 

Thr Thr His Val lie Arg Asn Pro Asp Met Glu Arg Leu Lys Glu Asn 

1570 1575 1580 

Thr Asn His Asp Asp Ser Ser Arg Asp Ser Tyr Ser Ser Asp Arg His 
1585 1590 1595 1600 

Leu Ser Gin Tyr His Asp His His Lys Asp Arg His Gin Gly Asp Ser 

1605 1610 1615 

Tyr Lys Lys Ser Asp Ser Arg Lys Arg Pro Tyr Ser Ser Phe Ser Asn 
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1620 1625 1630 

Gly Lys Asp His Arg Glu Trp Asp His Tyr Arg Gin Asp Ser Arg Tyr 

1635 1640 1645 

Tyr Ser Asp Arg Glu Lys His Arg Lys Leu Asp Asp His Arg Ser Arg 

1650 1655 1660 

Glu His Arg Pro Ser Leu Glu Gly Gly Leu Lys Asp Leu Asp Gin Arg 
1665 1670 1675 1680 

Ser Pro Tyr Gly Ser Arg Ser Pro Phe Glu His Ser Ala Glu His Arg 

1685 1690 1695 

Ser Thr Pro Glu His Thr Trp Ser Ser Arg Lys Thr Xaa Gin Lys Leu 

1700 1705 1710 

Met Ser Leu Ser Ser Gly Thr Leu Phe Xaa Pro Leu Thr Xaa Leu Glu 

1715 1720 1725 

Arg Tyr Gly Leu Asp lie Leu Ser Val Ala Val Leu Leu Leu Leu Ser 

1730 1735 1740 

Arg Met Gin Gly Leu Leu Ser Gin Gin Lys Lys Asn lie Phe Val Phe 
1745 1750 1755 1760 

Lys Val Tyr Ala Ala Leu Cys Cys Lys Cys Cys Gly Thr Phe Phe Leu 

1765 1770 1775 

Arg Asn Gly Arg Cys Leu Leu Leu Gin Gly Pro Gin His Cys Pro Phe 

1780 1785 1790 

Gin Thr Gly Ser Tyr Tyr Lys Thr Leu His Val Lys Val Val Leu Gly 

1795 1800 1805 

Xaa Thr Gin lie Lys Leu Cys Leu Xaa Met Asn Thr Xaa Thr Leu Thr 
1810 1815 1820 
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Cys Ala Tyr Val Ser Gly Lys Asn Gly Gly Phe lie Leu Phe Tyr Phe 
1825 1830 1835 1840 

Leu Val Glu Asn Ser Gin Gly Leu Cys Ser Leu Ser Lys Ala Thr Cys 

1845 1850 1855 

Leu His Cys Thr Leu Arg Pro Pro Cys Arg Phe Ser Ser Gin Ala Xaa 

1860 1865 1870 

lie Phe Lys Phe Cys Thr Tyr Ser Cys Lys lie Ala Arg lie Ser Pro 

1875 1880 1885 

Val Cys Asp Gin Leu Xaa Cys Leu Phe Met Lys Gin Thr Asn Lys Gin 

1890 1895 1900 

Lys Thr lie Lys Lys Lys Asn Thr Thr Lys Pro Thr Asn Gly Cys Lys 
1905 1910 1915 1920 

Leu Leu Xaa lie Asn Xaa Met Ser Phe Phe Pro Ser Gly Phe Phe Trp 

1925 1930 1935 

Leu Phe Leu Ser Pro Thr Thr Gin Ala Phe Phe Ser Gin Ser Gin Tyr 

1940 1945 1950 

Thr Tyr Met Phe Xaa Xaa Asn lie Ser Met Glu Ser Glu Cys Lys Asn 

1955 1960 1965 

Gly Glu Gly Asn lie Leu Phe His Leu Val Leu Leu Phe Tyr Trp lie 

1970 1975 1980 

Leu Leu His Thr Cys Phe Trp Leu Phe Tyr Phe lie Phe Phe Phe Tyr 
1985 1990 1995 2000 

Xaa Thr Val Ser Val Val He Val Val Met Asn Ser Glu Asn He Pro 

2005 2010 2015 

Leu Xaa Thr Val Pro Trp Lys Ala Phe Gin Val His Trp Phe Lys Arg 
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2020 2025 2030 

Arg Lys Cys Ser lie Gly Glu His Phe Lys Thr Gin lie Ser Gin Asp 

2035 2040 2045 

Ser Leu Xaa lie His Leu Phe Ser Leu Phe Asn Met Gly Asn Asn Val 

2050 2055 2060 

Lys Cys Ala Met Gin Gin Leu lie Phe Xaa Lys lie Xaa Met Thr Leu 
2065 2070 2075 2080 

Leu Thr Glu Leu Leu Gin Cys Thr Leu lie Val His Arg Xaa Leu Leu 

2085 2090 2095 

Ser Asp Lys Leu Asn Xaa Leu Lys Pro Lys Lys Thr 
2100 2105 



(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1795 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Phe Ala Leu Cys Pro Pro Val Thr Gin Arg Glu Pro Gin Glu Thr Arg 

15 10 15 

Glu Cys Arg Lys Phe lie Phe Glu lie Leu lie Phe Glu Glu lie Cys 

20 25 30 

lie His Thr His Leu Leu Leu lie Gly Asp Phe Cys Phe lie Asn Phe 
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35 

Leu lie Phe Thr 
50 

Gly Ser Gly Glu 
65 

Gly Ser Gly Ser 

Ser Gin Ser Gly 
100 

Gin Ser Glu Ser 
115 

Lys Pro Pro Lys 
130 

lie Leu Ala Val 
145 

Ala Gin Gin Gin 

Ser Ser Ser Ser 
180 

Lys Lys His Asn 
195 

Ser Gin Leu Gly 
210 

Ser Cys Asp Gly 
225 



40 

Met Asn Gly His 

55 

Ser Ser Gin Ser 
70 

Gly Ser Ser Ser 
85 

Ser Ser Asp Ser 

Glu Ser Asp Thr 
120 

Val Asp Gly Ala 
135 

Gin Arg Ser Ala 
150 

Arg Pro Ala Ser 
165 

Glu Asp Ser Asp 

Asp Glu Asp Trp 
200 

Ser Asp Ser Glu 
215 

Thr Glu Ser Asp 
230 



Ser Asp Glu Glu 
60 

Gly Asp Asp Cys 
75 

Gly Ser Ser Ser 
90 

Asp Ser Gly Ser 
105 

Ser Arg Glu Asn 

Glu Phe Trp Lys 
140 

Met Leu Arg Lys 
155 

Ser Asn Ser Gly 
170 

Asp Ser Ser Ser 
185 

Gin Met Ser Gly 

Ser Glu Glu Glu 
220 

Tyr Glu Pro Lys 

235 



45 

Ser Val Arg Asn 

Gly Ser Ala Ser 
80 

Asp Gly Ser Ser 
95 

Asp Ser Gly Ser 
110 

Lys Val Gin Ala 
125 

Ser Ser Pro Ser 

Gin Pro Gin Gin 
160 

Ser Glu Glu Asp 
175 

Gly Ala Lys Arg 
190 

Ser Gly Ser Pro 
205 

Arg Asp Lys Ser 

Asn Lys Val Arg 
240 



Ser Arg Lys Pro 

Leu Gly Gin Lys 
260 

Glu Asp Tyr Asp 
275 

Asn Val Ser Tyr 
290 

Leu Leu Glu Val 
305 

Phe Glu Thr lie 

Ala Thr Gly Ala 
340 

Pro Asn Ala Gly 
355 

Leu lie Lys Trp 
370 

Glu Glu Thr Leu 

385 

Asn Tyr Lys Lys 

Ser Pro Glu Asp 
420 

Asp Leu His Lys 



Gin Asn Arg Ser 
245 

Lys Arg Gin lie 

Asn Asp Lys Arg 
280 

Lys Glu Asp Glu 

295 

Cys Gly Glu Asp 
310 

Glu Arg Val Met 
325 

Thr Thr Thr lie 

Phe Glu Arg Asn 
360 

Lys Gly Trp Ser 
375 

Lys Gin Gin Asn 
390 

Lys Asp Gin Glu 
405 

Val Glu Tyr Tyr 
Gin Tyr Gin lie 



Lys Ser Lys Asn 
250 

Asp Ser Ser Glu 
265 

Ser Ser Arg Arg 

Glu Met Lys Thr 
300 

Val Pro Gin Pro 
315 

Asp Cys Arg Val 
330 

Tyr Ala Val Glu 
345 

Lys Glu Pro Gly 

His lie His Asn 
380 

Val Arg Gly Met 

395 

Thr Lys Arg Trp 
410 

Asn Cys Gin Gin 
425 

Val Glu Arg lie 



Gly Lys Lys lie 

255 

Asp Glu Asp Asp 
270 

Gin Ala Thr Val 
285 

Asp Ser Asp Asp 

Glu Asp Glu Glu 
320 

Gly Arg Lys Gly 

335 

Ala Asp Gly Asp 
350 

Asp lie Gin Tyr 
365 

Thr Trp Glu Thr 

Lys Lys Leu Asp 
400 

Leu Lys Asn Ala 
415 

Glu Leu Thr Asp 
430 

lie Ala His Ser 
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435 440 445 

Asn Gin Lys Ser Ala Ala Gly Leu Pro Asp Tyr Tyr Cys Lys Trp Gin 

450 455 460 

Gly Leu Pro Tyr Ser Glu Cys Ser Trp Glu Asp Gly Ala Leu lie Ser 
465 470 475 480 

Lys Lys Phe Gin Thr Cys lie Asp Glu Tyr Phe Ser Arg Asn Gin Ser 

485 490 495 

Lys Thr Thr Pro Phe Lys Asp Cys Lys Val Leu Lys Gin Arg Pro Arg 

500 505 510 

Phe Val Ala Leu Lys Lys Gin Pro Ser Tyr lie Gly Gly His Glu Gly 

515 520 525 

Leu Glu Leu Arg Asp Tyr Gin Leu Asn Gly Leu Asn Trp Leu Ala His 

530 535 540 

Ser Trp Cys Lys Gly Asn Ser Cys lie Leu Ala Asp Glu Met Gly Leu 
545 550 555 560 

Gly Lys Thr lie Gin Thr lie Ser Phe Leu Asn Tyr Leu Phe His Glu 

565 570 575 

His Gin Leu Tyr Gly Pro Phe Leu Leu Val Val Pro Leu Ser Thr Leu 

580 585 590 

Thr Ser Trp Gin Arg Glu lie Gin Thr Trp Ala Ser Gin Met Asn Ala 

595 600 605 

Val Val Tyr Leu Gly Asp lie Asn Ser Arg Asn Met lie Arg Thr His 

610 615 620 

Glu Trp Met His Pro Gin Thr Lys Arg Leu Lys Phe Asn lie Leu Leu 
625 630 635 640 



Thr Thr Tyr Glu 

Asn Trp Ala Phe 
660 

Asp Ser Leu Leu 
675 

Leu Leu lie Thr 
690 

Ser Leu Leu His 
705 

Phe Glu Glu Glu 

His Lys Glu Leu 
740 

Glu Lys Ser Leu 
755 

Ser Ala Leu Gin 
770 

Lys Ala Leu Ser 
785 

He Met Met Glu 

Pro Pro Asp Asn 
820 

Leu He Arg Ser 



He Leu Leu Lys 
645 

He Gly Val Asp 

Tyr Lys Thr Leu 
680 

Gly Thr Pro Leu 
695 

Phe He Met Pro 
710 

His Gly Lys Gly 
725 

Glu Pro Phe Leu 

Pro Ala Lys Val 
760 

Lys Gin Tyr Tyr 

775 

Lys Gly Ser Lys 
790 

Leu Lys Lys Cys 
805 

Asn Glu Phe Tyr 
Ser Gly Lys Leu 



Asp Lys Ala Phe 
650 

Glu Ala His Arg 
665 

He Asp Phe Lys 

Gin Asn Ser Leu 
700 

Glu Lys Phe Ser 
715 

Arg Glu Tyr Gly 
730 

Leu Arg Arg Val 
745 

Glu Gin He Leu 

Lys Trp He Leu 
780 

Gly Ser Thr Ser 

795 

Cys Asn His Cys 
810 

Asn Lys Gin Glu 
825 

lie Leu Leu Asp 



Leu Gly Gly Leu 
655 

Leu Lys Asn Asp 
670 

Ser Asn His Arg 
685 

Lys Glu Leu Trp 

Ser Trp Glu Asp 
720 

Tyr Ala Ser Leu 
735 

Lys Lys Asp Val 
750 

Arg Met Glu Met 

765 

Thr Arg Asn Tyr 

Gly Phe Leu Asn 
800 

Tyr Leu He Lys 
815 

Ala Leu Gin His 
830 

Lys Leu Leu He 
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835 840 845 

Arg Leu Arg Glu Arg Gly Asn Arg Val Leu lie Phe Ser Gin Met Val 

850 855 860 

Arg Met Leu Asp lie Leu Ala Glu Tyr Leu Lys Tyr Arg Gin Phe Pro 
865 870 875 880 

Phe Gin Arg Leu Asp Gly Ser lie Lys Gly Glu Leu Arg Lys Gin Ala 

885 890 895 

Leu Asp His Phe Asn Ala Glu Gly Ser Glu Asp Phe Cys Phe Leu Leu 

900 905 910 

Ser Thr Arg Ala Gly Gly Leu Gly lie Asn Leu Ala Ser Ala Asp Thr 

915 920 925 

Val Val lie Phe Asp Ser Asp Trp Asn Pro Gin Asn Asp Leu Gin Ala 

930 935 940 

Gin Ala Arg Ala His Arg lie Gly Gin Lys Lys Gin Val Asn lie Tyr 
945 950 955 960 

Arg Leu Val Thr Lys Gly Ser Val Glu Glu Asp lie Leu Glu Arg Ala 

965 970 975 

Lys Lys Lys Met Val Leu Asp His Leu Val lie Gin Arg Met Asp Thr 

980 985 990 

Thr Gly Lys Thr Val Leu His Thr Gly Ser Thr Pro Ser Ser Ser Thr 

995 1000 1005 

Pro Phe Asn Lys Glu Glu Leu Ser Ala lie Leu Lys Phe Gly Ala Glu 

1010 1015 1020 

Glu Leu Phe Lys Glu Pro Glu Gly Glu Glu Gin Glu Pro Gin Glu Met 
1025 1030 1035 1040 
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Asp lie Asp Glu lie Leu Lys Arg Ala Glu Thr Arg Glu Asn Glu Pro 

1045 1050 1055 

Gly Pro Leu Thr Val Gly Asp Glu Leu Leu Ser Gin Phe Lys Val Ala 

1060 1065 1070 

Asn Phe Ser Asn Met Asp Glu Asp Asp lie Glu Leu Glu Pro Glu Arg 

1075 1080 1085 

Asn Ser Arg Asn Trp Glu Glu lie lie Pro Glu Ser Gin Arg Arg Arg 

1090 1095 1100 

lie Glu Glu Glu Glu Arg Gin Lys Glu Leu Glu Glu lie Tyr Met Leu 
1105 1110 1115 1120 

Pro Arg Met Arg Asn Cys Ala Lys Gin lie Ser Phe Asn Gly Ser Glu 

1125 1130 1135 

Gly Arg Arg Ser Arg Ser Arg Arg Tyr Ser Gly Ser Asp Ser Asp Ser 

1140 1145 1150 

lie Thr Glu Arg Lys Arg Pro Lys Lys Arg Gly Arg Pro Arg Thr lie 

1155 1160 1165 

Pro Arg Glu Asn lie Lys Gly Phe Ser Asp Ala Glu lie Arg Arg Phe 

1170 1175 1180 

lie Lys Ser Tyr Lys Lys Phe Gly Gly Pro Leu Glu Arg Leu Asp Ala 
1185 1190 1195 1200 

Val Ala Arg Asp Ala Glu Leu Val Asp Lys Ser Glu Thr Asp Leu Arg 

1205 1210 1215 

Arg Leu Gly Glu Leu Val His Asn Gly Cys lie Lys Ala Leu Lys Asp 

1220 1225 1230 

Asn Ser Ser Gly Gin Glu Arg Ala Gly Gly Arg Leu Gly Lys Val Lys 
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1235 1240 1245 

Gly Pro Thr Phe Arg lie Ser Gly Val Gin Val Asn Ala Lys Leu Val 

1250 1255 1260 

lie Ser His Glu Glu Glu Leu Ala Pro Leu His Lys Ser lie Pro Ser 
1265 1270 1275 1280 

Asp Pro Glu Glu Arg Lys Arg Tyr Val lie Pro Cys His Thr Lys Ala 

1285 1290 1295 

Ala His Phe Asp lie Asp Trp Gly Lys Glu Asp Asp Ser Asn Leu Leu 

1300 1305 1310 

Val Gly lie Tyr Glu Tyr Gly Tyr Gly Ser Trp Glu Met lie Lys Met 

1315 1320 1325 

Asp Pro Asp Leu Ser Leu Thr Gin Lys lie Leu Pro Asp Asp Pro Asp 

1330 1335 1340 

Lys Lys Pro Gin Ala Lys Gin Leu Gin Thr Arg Ala Asp Tyr Leu lie 
1345 1350 1355 1360 

Lys Leu Leu Asn Lys Asp Leu Ala Arg Lys Glu Ala Gin Arg Leu Ala 

1365 1370 1375 

Gly Ala Gly Asn Ser Lys Arg Arg Lys Thr Arg Asn Lys Lys Asn Lys 

1380 1385 1390 

Met Lys Ala Ser Lys lie Lys Glu Glu lie Lys Ser Asp Ser Ser Pro 

1395 1400 1405 

Gin Pro Ser Glu Lys Ser Asp Glu Asp Asp Glu Glu Glu Asp Asn Lys 

1410 1415 1420 

Val Asn Glu Met Lys Ser Glu Asn Lys Glu Lys Ser Lys Lys lie Pro 
1425 1430 1435 1440 



Leu Leu Asp Thr Pro Val His lie Thr Ala Thr Ser Glu Pro Val Pro 

1445 1450 1455 

lie Ser Glu Glu Ser Glu Glu Leu His Gin Lys Thr Phe Ser Val Cys 

1460 1465 1470 

Lys Glu Arg Met Arg Pro Val Lys Ala Ala Leu Lys Gin Leu Asp Arg 

1475 1480 1485 

Pro Glu Lys Gly Leu Ser Glu Arg Glu Gin Leu Glu His Thr Arg Gin 

1490 1495 1500 

Cys Leu lie Lys lie Gly Asp His lie Thr Glu Cys Leu Lys Glu Tyr 
1505 1510 1515 1520 

Thr Asn Pro Glu Gin lie Lys Gin Trp Arg Lys Asn Leu Trp lie Phe 

1525 1530 1535 

Val Ser Lys Phe Thr Glu Phe Asp Ala Arg Lys Leu His Lys Leu Tyr 

1540 1545 1550 

Lys His Ala lie Lys Lys Arg Gin Glu Ser Gin Gin His Asn Asp Gin 

1555 1560 1565 

Asn lie Ser Ser Asn Val Asn Thr His Val lie Arg Asn Pro Asp Val 

1570 1575 1580 

Glu Arg Leu Lys Glu Thr Thr Asn His Asp Asp Ser Ser Arg Asp Ser 
1585 1590 1595 1600 

Tyr Ser Ser Asp Arg His Leu Ser Gin Tyr His Asp His His Lys Asp 

1605 1610 1615 

Arg His Gin Gly Asp Ala Tyr Lys Lys Ser Asp Ser Arg Lys Arg Pro 

1620 1625 1630 

Tyr Ser Ala Phe Ser Asn Gly Lys Asp His Arg Asp Trp Asp His Tyr 
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1635 1640 1645 

Lys Gin Asp Ser Arg Tyr Tyr Ser Asp Ser Lys His Arg Lys Leu Asp 

1650 1655 1660 

Asp His Arg Ser Arg Asp His Arg Ser Asn Leu Glu Gly Asn Leu Lys 
1665 1670 1675 1680 

Asp Ser Arg Gly His Ser Asp His Arg Ser His Ser Asp His Afg lie 

1685 1690 1695 

His Ser Asp His Arg Ser Thr Ser Glu Tyr Ser His His Lys Ser Ser 

1700 1705 1710 

Arg Asp Tyr Arg Tyr His Ser Asp Trp Gin Met Asp His Arg Ala Ser 

1715 1720 1725 

Gly Ser Gly Pro Arg Ser Pro Leu Asp Gin Arg Ser Pro Tyr Gly Ser 

1730 1735 1740 

Arg Ser Pro Leu Gly His Arg Ser Pro Phe Glu His Ser Ser Asp His 
1745 1750 1755 1760 

Lys Ser Thr Pro Glu His Thr Trp Ser Ser Arg Lys Thr Xaa Gin Arg 

1765 1770 1775 

Leu Thr Phe Ser Gly Pro Ser Phe Xaa Pro Tyr Thr Val Asn Xaa His 
1780 1785 1790 

Ser Asn Cys 
1795 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 



- 87 - 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Lys Thr Xaa Glu Pro Gly Glu lie Gin Tyr Leu lie Lys Trp Lys Gly 

15 10 15 

Trp Ser His lie His Asn Thr Trp Glu Thr Glu Glu Thr Leu Lys Gin 

20 25 30 

Gin Asn Val Arg Gly Met Lys Lys Leu Asp Asn Tyr Lys Lys Lys Asp 

35 40 45 

Gin Glu Thr Lys Arg Trp Leu Lys Asn Ala Ser Pro Glu Asp Val Glu 

50 55 60 

Tyr Tyr Asn Cys Gin Gin Glu Leu Thr Asp Asp Leu His Lys Gin Tyr 
65 70 75 80 

Gin lie Val Glu Arg Thr Asn Xaa Ser Phe Gin Ser Lys Ser Ala Ala 

85 90 95 

Gly Tyr Pro 

INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

lie Tyr Arg Leu Val Thr Lys Gly Ser Val Glu Glu Asp lie Leu Glu 

15 10 15 

Arg Ala Lys Lys Lys Met Val Leu Asp His Leu Val lie Gin Arg Met 

20 25 30 

Asp Thr Thr Gly Lys Thr Val Leu His Thr Gly Ser Thr Pro Ser Ser 

35 40 45 

Ser Thr Pro Phe Asn Lys Glu Glu Leu Ser Ala lie Leu Lys Phe Gly 

50 55 60 

Ala Glu Glu Leu Phe Lys Glu Pro Glu Xaa Glu Glu Glu Glu Pro Gin 
65 70 75 80 

Glu Met Asp lie Asp Glu lie Leu Lys Arg Xaa Glu Thr Arg Glu Asn 

85 90 95 

Glu Ser Gly Pro Leu Thr Val Gly Asp Glu Leu Leu Ser Gin Phe Lys 

100 105 110 

Val Ala Asn Phe Ser Asn Met Asp Glu Asp Asp lie Glu Leu Glu Pro 

115 120 125 

Glu Gin Asn Leu Arg Asn Trp Glu Glu lie lie Pro Glu Val Gin Trp 

130 135 140 

Arg Arg lie Glu Gly Xaa Glu Arg Gin Lys Glu Leu Glu Glu lie Tyr 
145 150 155 160 

Met Leu Pro Arg Met Arg Asn Cys Ala Lys Gin lie Ser Phe Asn Gly 

165 170 175 

Asn Glu Gly Arg Cys Ser Arg Ser Arg Arg Tyr Ser Gly Ser Asp Ser 



180 

Asp Ser lie Ser 
195 

Thr lie Pro Arg 
210 

Arg Phe lie Lys 

225 

Asp Ala lie Ala 

Leu Arg Arg Leu 
260 

Asn Asp Asn Asp 

275 

Val Lys Gly Pro 
290 

Leu Val lie Ser 
305 

Pro Ser Asp Pro 

Lys Ala Ala His 
340 

Leu Leu lie Gly 
355 

Lys Met Asp Pro 

370 



Glu Arg Lys Arg 
200 

Glu Asn lie Lys 
215 

Ser Tyr Lys Lys 
230 

Arg Asp Ala Glu 
245 

Gly Glu Leu Val 

Phe Gly Gin Gly 
280 

Thr Phe Arg lie 
295 

His Glu Glu Glu 
310 

Glu Glu Arg Lys 
325 

Phe Asp lie Asp 

lie Tyr Glu Tyr 
360 

Asp Leu Ser Leu 

375 
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185 

Pro Lys Lys Arg 

Gly Phe Ser Asp 
220 

Phe Gly Gly Pro 

235 

Leu Val Asp Lys 
250 

His Asn Gly Cys 
265 

Arg Thr Gly Gly 

Ala Gly Val Gin 
300 

Leu Ala Pro Leu 
315 

Arg Tyr Val lie 
330 

Trp Gly Lys Glu 
345 

Gly Tyr Gly Ser 

Thr Gin Lys lie 

380 



190 

Gly Arg Pro Arg 
205 

Ala Glu lie Arg 

Val Glu Arg Leu 
240 

Ser Glu Thr Asp 
255 

lie Lys Ala Leu 
270 

Arg Phe Gly Lys 
285 

Val Asn Ala Lys 

His Lys Ser lie 
320 

Pro Tyr His Thr 

335 

Asp Asp Ser Asn 
350 

Trp Glu Met lie 
365 

Leu Pro Asp Asp 
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Pro Asp Lys Lys Pro Gin Ala Lys Gin Leu Gin Thr Arg Ala Asp Tyr 
385 390 395 400 

Leu lie Lys Leu Leu Asn Lys Asp Leu Ala Arg Lys Glu Ala Gin Arg 

405 410 415 

Leu Ala Gly Ala Gly Asn Ser Lys Arg Arg Lys Thr Arg Ser Lys Lys 

420 425 430 

Asn Lys Ala Thr Lys Ala Ala 
435 



(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1434 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .1434 

(D) OTHER INFORMATION: /note= "The sequence beginning at 1 

corresponds to 55 and that ending at 1434 corresponds to 

1488" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Lys Pro Pro Lys Ala Asp Gly Ser Glu Phe Trp Lys Ser Ser Pro Ser 
15 10 15 



He Leu Ala Val 
20 

Gin Lys Ala Ala 
35 

Ser Glu Asp Ser 
50 

His Lys Asp Glu 

65 

Thr Gly Ser Asp 

Glu Glu Ser Glu 
100 

Lys Pro Pro Ser 
115 

Gin Lys Lys Arg 
130 

Glu Asp Tyr Asp 
145 

Ser Tyr Lys Glu 

Glu Val Cys Gly 
180 

Thr He Glu Lys 
195 

Gly Ala Ser Thr 



Gin Arg Ser Ala 

Ser Ser Asp Ser 
40 

Ala Asp Asp Ser 
55 

Asp Trp Gin Met 
70 

Ser Glu Ser Ala 
85 

Ser Asp Tyr Glu 

Arg He Lys Pro 
120 

Gin Leu Asp Ser 
135 

Lys Arg Gly Ser 
150 

Ala Glu Glu Thr 
165 

Glu Asp Val Pro 

Phe Met Asp Ser 
200 

Thr He Tyr Ala 
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Val Leu Lys Lys 
25 

Gly Ser Glu Glu 

Ser Ser Glu Thr 
60 

Ser Gly Ser Gly 
75 

Glu Asp Gly Asp 
90 

Pro Lys Asn Lys 
105 

Lys Ser Gly Lys 

Ser Glu Glu Glu 
140 

Arg Arg Gin Ala 
155 

Lys Thr Asp Ser 
170 

Gin Thr Glu Glu 
185 

Arg He Gly Arg 
Val Glu Ala Asp 



Gin Gin Gin Gin 

30 

Asp Ser Ser Ser 
45 

Lys Lys Lys Lys 

Ser Val Ser Gly 
80 

Lys Ser Ser Cys 
95 

Val Lys Ser Arg 
110 

Lys Ser Thr Gly 
125 

Glu Asp Asp Asp 

Thr Val Asn Val 
160 

Asp Asp Leu Leu 
175 

Asp Glu Phe Glu 
190 

Lys Gly Ala Thr 
205 

Gly Asp Pro Asn 
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210 

Ala Gly Phe Glu 

225 

Lys Trp Lys Gly 

Thr Leu Lys Gin 
260 

Lys Lys Lys Asp 
275 

Glu Asp Val Glu 
290 

His Lys Gin Tyr 
305 

Lys Ser Ala Ala 

Pro Tyr Ser Glu 
340 

Phe Gin Ala Arg 
355 

Thr Pro Phe Lys 
370 

Ala Leu Lys Lys 
385 

Leu Arg Asp Tyr 



215 

Lys Ser Lys Glu 
230 

Trp Ser His lie 
245 

Gin Asn Val Lys 

Gin Glu Thr Lys 
280 

Tyr Tyr Asn Cys 
295 

Gin lie Val Glu 
310 

Gly Tyr Pro Asp 
325 

Cys Ser Trp Glu 

lie Asp Glu Tyr 
360 

Asp Cys Lys Val 
375 

Gin Pro Ser Tyr 
390 

Gin Leu Asn Gly 
405 



220 

Leu Gly Glu lie 

235 

His Asn Thr Trp 
250 

Gly Met Asn Lys 
265 

Arg Trp Leu Lys 

Gin Gin Glu Leu 
300 

Arg lie lie Ala 
315 

Tyr Tyr Cys Lys 
330 

Asp Gly Ala Leu 
345 

Phe Ser Arg Asn 

Leu Lys Gin Arg 

380 

lie Gly Gly His 

395 

Leu Asn Trp Leu 
410 



Gin Tyr Leu lie 
240 

Glu Thr Glu Glu 
255 

Leu Asp Asn Tyr 
270 

Asn Ala Ser Pro 
285 

Thr Asp Asp Leu 

His Ser Asn Gin 
320 

Trp Gin Gly Leu 

335 

lie Ala Lys Lys 
350 

Gin Ser Lys Thr 
365 

Pro Arg Phe Val 

Glu Ser Leu Glu 
400 

Ala His Ser Trp 
415 
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Cys Lys Gly Asn Ser Cys lie Leu Ala Asp Glu Met Gly Leu Gly Lys 

420 425 430 

Thr lie Gin Thr lie Ser Phe Leu Asn Tyr Leu Phe His Glu His Gin 

435 440 445 

Leu Tyr Gly Pro Phe Leu Leu Arg Val Pro Leu Ser Thr Leu Thr Ser 

450 455 460 

Trp Gin Arg Glu lie Gin Thr Trp Ala Pro Gin Met Asn Ala Val Val 
465 470 475 480 

Tyr Leu Gly Asp lie Thr Ser Arg Asn Met lie Arg Thr His Glu Trp 

485 490 495 

Met His Pro Gin Thr Lys Arg Leu Lys Phe Asn lie Leu Leu Thr Thr 

500 505 510 

Tyr Glu lie Leu Leu Lys Asp Lys Ser Phe Leu Gly Gly Leu Asn Trp 

515 520 525 

Ala Phe lie Gly Val Asp Glu Ala His Arg Leu Lys Asn Asp Asp Ser 

530 535 540 

Leu Leu Tyr Arg Thr Leu lie Asp Phe Lys Ser Asn His Arg Leu Leu 
545 550 555 560 

lie Thr Gly Thr Pro Leu Gin Asn Ser Leu Lys Glu Leu Trp Ser Leu 

565 570 575 

Leu His Phe lie Met Pro Glu Lys Phe Ser Ser Trp Glu Asp Phe Glu 

580 585 590 

Glu Glu His Gly Lys Gly Arg Glu Tyr Gly Tyr Ala Ser Leu His Lys 

595 600 605 

Glu Leu Glu Pro Phe Leu Leu Arg Arg Val Lys Lys Asp Val Glu Lys 
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610 615 620 

Ser Leu Pro Ala Lys Val Glu Gin lie Leu Arg Met Glu Met Ser Ala 
625 630 635 640 

Leu Gin Lys Gin Tyr Tyr Lys Trp lie Leu Thr Arg Asn Tyr Lys Ala 

645 650 655 

Leu Ser Lys Gly Ser Lys Gly Ser Thr Ser Gly Phe Leu Asn lie Met 

660 665 670 

Met Glu Leu Lys Lys Cys Cys Asn His Cys Tyr Leu lie Lys Pro Pro 

675 680 685 

Asp Asp Asn Glu Phe Tyr Asn Lys Gin Glu Ala Leu Gin His Leu lie 

690 695 700 

Arg Ser Ser Gly Lys Leu lie Leu Leu Asp Lys Leu Leu lie Arg Leu 
705 710 715 720 

Arg Glu Arg Gly Asn Arg Val Leu lie Phe Ser Gin Met Val Arg Met 

725 730 735 

Leu Asp lie Leu Ala Glu Tyr Leu Lys Tyr Arg Gin Phe Pro Phe Gin 

740 745 750 

Arg Leu Asp Gly Ser lie Lys Gly Glu Leu Arg Lys Gin Ala Leu Asp 

755 760 765 

His Phe Asn Ala Glu Gly Ser Glu Asp Phe Cys Phe Leu Leu Ser Thr 

770 775 780 

Arg Ala Gly Gly Leu Gly lie Asn Leu Ala Ser Ala Asp Thr Val Val 
785 790 795 800 

lie Phe Asp Ser Asp Trp Asn Pro Gin Asn Asp Leu Gin Ala Gin Ala 

805 810 815 



Arg Ala His Arg lie Gly Gin Lys Lys Gin Val Asn lie Tyr Arg Leu 

820 825 830 

Val Thr Lys Gly Ser Val Glu Glu Asp lie Leu Glu Arg Ala Lys Lys 

835 840 845 

Lys Met Val Leu Asp His Leu Val lie Gin Arg Met Asp Thr Thr Gly 

850 855 860 

Lys Thr Val Leu His Thr Gly Ser Thr Pro Ser Ser Ser Thr Pro Phe 
865 870 875 880 

Asn Lys Glu Glu Leu Ser Ala lie Leu Lys Phe Gly Ala Glu Glu Leu 

885 890 895 

Phe Lys Glu Pro Glu Gly Glu Glu Gin Glu Pro Gin Glu Met Asp lie 

900 905 910 

Asp Glu lie Leu Lys Arg Ala Glu Thr Arg Glu Asn Glu Pro Gly Pro 

915 920 925 

Leu Thr Val Gly Asp Glu Leu Leu Ser Gin Phe Lys Val Ala Asn Phe 

930 935 940 

Ser Asn Met Asp Glu Asp Asp lie Glu Leu Glu Pro Glu Arg Asn Ser 
945 950 955 960 

Arg Asn Trp Glu Glu lie lie Pro Glu Ser Gin Arg Arg Arg lie Glu 

965 970 975 

Glu Glu Glu Arg Gin Lys Glu Leu Glu Glu lie Tyr Met Leu Pro Arg 

980 985 990 

Met Arg Asn Cys Ala Lys Gin lie Ser Phe Asn Gly Ser Glu Gly Arg 

995 1000 1005 

Arg Ser Arg Ser Arg Arg Tyr Ser Gly Ser Asp Ser Asp Ser lie Thr 
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1010 1015 1020 

Glu Arg Lys Arg Pro Lys Lys Arg Gly Arg Pro Arg Thr lie Pro Arg 
1025 1030 1035 1040 

Glu Asn lie Lys Gly Phe Ser Asp Ala Glu lie Arg Arg Phe lie Lys 

1045 1050 1055 

Ser Tyr Lys Lys Phe Gly Gly Pro Leu Glu Arg Leu Asp Ala Val Ala 

1060 1065 1070 

Arg Asp Ala Glu Leu Val Asp Lys Ser Glu Thr Asp Leu Arg Arg Leu 

1075 1080 1085 

Gly Glu Leu Val His Asn Gly Cys lie Lys Ala Leu Lys Asp Asn Ser 

1090 1095 1100 

Ser Gly Gin Glu Arg Ala Gly Gly Arg Leu Gly Lys Val Lys Gly Pro 
1105 1110 1115 1120 

Thr Phe Arg lie Ser Gly Val Gin Val Asn Ala Lys Leu Val lie Ser 

1125 1130 1135 

His Glu Glu Glu Leu Ala Pro Leu His Lys Ser lie Pro Ser Asp Pro 

1140 1145 1150 

Glu Glu Arg Lys Arg Tyr Val lie Pro Cys His Thr Lys Ala Ala His 

1155 1160 1165 

Phe Asp lie Asp Trp Gly Lys Glu Asp Asp Ser Asn Leu Leu Val Gly 

1170 1175 1180 

lie Tyr Glu Tyr Gly Tyr Gly Ser Trp Glu Met lie Lys Met Asp Pro 
1185 1190 1195 1200 

Asp Leu Ser Leu Thr Gin Lys lie Leu Pro Asp Asp Pro Asp Lys Lys 

1205 1210 1215 



Pro Gin Ala Lys Gin Leu Gin Thr Arg Ala Asp Tyr Leu lie Lys Leu 

1220 1225 1230 

Leu Asn Lys Asp Leu Ala Arg Lys Glu Ala Gin Arg Leu Ala Gly Ala 

1235 1240 1245 

Gly Asn Ser Lys Arg Arg Lys Thr Arg Asn Lys Lys Asn Lys Met Lys 

1250 1255 1260 

Ala Ser Lys lie Lys Glu Glu lie Lys Ser Asp Ser Ser Pro Gin Pro 
1265 1270 1275 1280 

Ser Glu Lys Ser Asp Glu Asp Asp Glu Glu Glu Asp Asn Lys Val Asn 

1285 1290 1295 

Glu Met Lys Ser Glu Asn Lys Glu Lys Ser Lys Lys lie Pro Leu Leu 

1300 1305 1310 

Asp Thr Pro Val His lie Thr Ala Thr Ser Glu Pro Val Pro lie Ser 

1315 1320 1325 

Glu Glu Ser Glu Glu Leu His Gin Lys Thr Phe Ser Val Cys Lys Glu 

1330 1335 1340 

Arg Met Arg Pro Val Lys Ala Ala Leu Lys Gin Leu Asp Arg Pro Glu 
1345 1350 1355 1360 

Lys Gly Leu Ser Glu Arg Glu Gin Leu Glu His Thr Arg Gin Cys Leu 

1365 1370 1375 

lie Lys lie Gly Asp His lie Thr Glu Cys Leu Lys Glu Tyr Thr Asn 

1380 1385 1390 

Pro Glu Gin lie Lys Gin Trp Arg Lys Asn Leu Trp lie Phe Val Ser 

1395 1400 1405 

Lys Phe Thr Glu Phe Asp Ala Arg Lys Leu His Lys Leu Tyr Lys His 
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1410 1415 
Ala lie Lys Lys Arg Gin Glu Ser Gin Gin 
1425 1430 



1420 



(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1467 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 1. .1467 

(D) OTHER INFORMATION: /note= "The sequence beginning at 1 

corresponds to 2654 and that ending at 1467 corresponds to 

4120" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Ala Ala Lys Asp lie Ser Thr Glu Val Leu Gin Asn Pro Glu Leu 

15 10 15 

Tyr Gly Leu Arg Arg Ser His Arg Ala Ala Ala His Gin Gin Asn Tyr 

20 25 30 

Phe Asn Asp Ser Asp Asp Glu Asp Asp Glu Asp Asn lie Lys Gin Ser 

35 40 45 

Arg Arg Lys Arg Met Thr Thr lie Glu Asp Asp Glu Asp Glu Phe Glu 
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50 55 60 

Asp Glu Glu Gly Glu Glu Asp Ser Gly Glu Asp Glu Asp Glu Glu Asp 
65 70 75 80 

Phe Glu Glu Asp Asp Asp Tyr Tyr Gly Ser Pro lie Lys Gin Asn Arg 

85 90 95 

Ser Lys Pro Lys Ser Arg Thr Lys Ser Lys Ser Lys Ser Lys Pro Lys 

100 105 110 

Ser Gin Ser Glu Lys Gin Ser Thr Val Lys lie Pro Thr Arg Phe Ser 

115 120 125 

Asn Arg Gin Asn Lys Thr Val Asn Tyr Asn lie Asp Tyr Ser Asp Asp 

130 135 140 

Asp Leu Leu Glu Ser Glu Asp Asp Tyr Gly Ser Glu Glu Ala Leu Ser 
145 150 155 160 

Glu Glu Asn Val His Glu Ala Ser Ala Asn Pro Gin Pro Glu Asp Phe 

165 170 175 

His Gly lie Asp lie Val lie Asn His Arg Leu Lys Thr Ser Leu Glu 

180 185 190 

Glu Gly Lys Val Leu Glu Lys Thr Val Pro Asp Leu Asn Asn Cys Lys 

195 200 205 

Glu Asn Tyr Glu Phe Leu lie Lys Trp Thr Asp Glu Ser His Leu His 

210 215 220 

Asn Thr Trp Glu Thr Tyr Glu Ser lie Gly Gin Val Arg Gly Leu Lys 
225 230 235 240 

Arg Leu Asp Asn Tyr Cys Lys Gin Phe lie lie Glu Asp Gin Gin Val 

245 250 255 
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Arg Leu Asp Pro Tyr Val Thr Ala Glu Asp lie Glu lie Met Asp Met 

260 265 270 

Glu Arg Glu Arg Arg Leu Asp Glu Phe Glu Glu Phe His Val Pro Glu 

275 280 285 

Arg lie lie Asp Ser Gin Arg Ala Ser Leu Glu Asp Gly Thr Ser Gin 

290 295 300 

Leu Gin Tyr Leu Val Lys Trp Arg Arg Leu Asn Tyr Asp Glu Ala Thr 
305 310 315 320 

Trp Glu Asn Ala Thr Asp lie Val Lys Leu Ala Pro Glu Gin Val Lys 

325 330 335 

His Phe Gin Asn Arg Glu Asn Ser Lys lie Leu Pro Gin Tyr Ser Ser 

340 345 350 

Asn Tyr Thr Ser Gin Arg Pro Arg Phe Glu Lys Leu Ser Val Gin Pro 

355 360 365 

Pro Phe lie Lys Gly Gly Glu Leu Arg Asp Phe Gin Leu Thr Gly lie 

370 375 380 

Asn Trp Met Ala Phe Leu Trp Ser Lys Gly Asp Asn Gly lie Leu Ala 
385 390 395 400 

Asp Glu Met Gly Leu Gly Lys Thr Val Gin Thr Val Ala Phe lie Ser 

405 410 415 

Trp Leu lie Phe Ala Arg Arg Gin Asn Gly Pro His lie lie Val Val 

420 425 430 

Pro Leu Ser Thr Met Pro Ala Trp Leu Asp Thr Phe Glu Lys Trp Ala 

435 440 445 

Pro Asp Leu Asn Cys lie Cys Tyr Met Gly Asn Gin Lys Ser Arg Asp 
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450 

Thr lie Arg Glu 
465 

Lys Thr Met Lys 

Lys Asp Arg Ala 
500 

Asp Glu Ala His 
515 

Leu Asn Ser Phe 
530 

Leu Gin Asn Asn 
545 

Pro Gly Arg Phe 

Glu Glu Gin Glu 
580 

Phe lie Leu Arg 
595 

Lys Thr Glu Arg 
610 

Tyr Tyr Lys Asn 
625 

Ala Lys Gly Gly 



455 

Tyr Glu Phe Tyr 
470 

Phe Asn Val Leu 
485 

Glu Leu Gly Ser 

Arg Leu Lys Asn 
520 

Lys Val Ala Asn 
535 

lie Lys Glu Leu 
550 

Thr lie Asp Gin 
565 

Glu Tyr lie His 

Arg Leu Lys Lys 
600 

lie Leu Arg Val 
615 

lie Leu Thr Lys 
630 

His Phe Ser Leu 
645 



460 

Thr Asn Pro Arg 
475 

Leu Thr Thr Tyr 
490 

lie Lys Trp Gin 
505 

Ala Glu Ser Ser 

Arg Met Leu lie 
540 

Ala Ala Leu Val 

555 

Glu lie Asp Phe 
570 

Asp Leu His Arg 
585 

Asp Val Glu Lys 

Glu Leu Ser Asp 
620 

Asn Tyr Ser Ala 
635 

Leu Asn lie Met 
650 



Ala Lys Gly Lys 
480 

Glu Tyr lie Leu 
495 

Phe Met Ala Val 
510 

Leu Tyr Glu Ser 
525 

Thr Gly Thr Pro 

Asn Phe Leu Met 
560 

Glu Asn Gin Asp 
575 

Arg lie Gin Pro 
590 

Ser Leu Pro Ser 
605 

Val Gin Thr Glu 

Leu Thr Ala Gly 
640 

Asn Glu Leu Lys 

655 
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Lys Ala Ser Asn His Pro Tyr Leu Phe Asp Asn Ala Glu Glu Arg Val 

660 665 670 

Leu Gin Lys Phe Gly Asp Gly Lys Met Thr Arg Glu Asn Val Leu Arg 

675 680 685 

Gly Leu lie Met Ser Ser Gly Lys Met Val Leu Leu Asp Gin Leu Leu 

690 695 700 

Thr Arg Leu Lys Lys Asp Gly His Arg Val Leu lie Phe Ser Gin Met 
705 710 715 720 

Val Arg Met Leu Asp lie Leu Gly Asp Tyr Leu Ser lie Lys Gly lie 

725 730 735 

Asn Phe Gin Arg Leu Asp Gly Thr Val Pro Ser Ala Gin Arg Arg lie 

740 745 750 

Ser lie Asp His Phe Asn Ser Pro Asp Ser Asn Asp Phe Val Phe Leu 

755 760 765 

Leu Ser Thr Arg Ala Gly Gly Leu Gly lie Asn Leu Met Thr Ala Asp 

770 775 780 

Thr Val Val lie Phe Asp Ser Asp Trp Asn Pro Gin Ala Asp Leu Gin 
785 790 795 800 

Ala Met Ala Arg Ala His Arg lie Gly Gin Lys Asn His Val Met Val 

805 810 815 

Tyr Arg Leu Val Ser Lys Asp Thr Val Glu Glu Glu Val Leu Glu Arg 

820 825 830 

Ala Arg Lys Lys Met lie Leu Glu Tyr Ala lie lie Ser Leu Gly Val 

835 840 845 

Thr Asp Gly Asn Lys Tyr Thr Lys Lys Asn Glu Pro Asn Ala Gly Glu 
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850 855 860 

Leu Ser Ala lie Leu Lys Phe Gly Ala Gly Asn Met Phe Thr Ala Thr 
865 870 875 880 

Asp Asn Gin Lys Lys Leu Glu Asp Leu Asn Leu Asp Asp Val Leu Asn 

885 890 895 

His Ala Glu Asp His Val Thr Thr Pro Asp Leu Gly Glu Ser His Leu 

900 905 910 

Gly Gly Glu Glu Phe Leu Lys Gin Phe Glu Val Thr Asp Tyr Lys Ala 

915 920 925 

Asp lie Asp Trp Asp Asp lie lie Pro Glu Glu Glu Leu Lys Lys Leu 

930 935 940 

Gin Asp Glu Glu Gin Lys Arg Lys Asp Glu Glu Tyr Val Lys Glu Gin 
945 950 955 960 

Leu Glu Met Met Asn Arg Arg Asp Asn Ala Leu Lys Lys lie Lys Asn 

965 970 975 

Ser Val Asn Gly Asp Gly Thr Ala Ala Asn Ser Asp Ser Asp Asp Asp 

980 985 990 

Ser Thr Ser Arg Ser Ser Arg Arg Arg Ala Arg Ala Asn Asp Met Asp 

995 1000 1005 

Ser lie Gly Glu Ser Glu Val Arg Ala Leu Tyr Lys Ala lie Leu Lys 

1010 1015 1020 

Phe Gly Asn Leu Lys Glu lie Leu Asp Glu Leu lie Ala Asp Gly Thr 
1025 1030 1035 1040 

Leu Pro Val Lys Ser Phe Glu Lys Tyr Gly Glu Thr Tyr Asp Glu Met 

1045 1050 1055 
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Met Glu Ala Ala Lys Asp Cys Val His Glu Glu Glu Lys Asn Arg Lys 

1060 1065 1070 

Glu lie Leu Glu Lys Leu Glu Lys His Ala Thr Ala Tyr Arg Ala Lys 

1075 1080 1085 

Leu Lys Ser Gly Glu lie Lys Ala Glu Asn Gin Pro Lys Asp Asn Pro 

1090 1095 1100 

Leu Thr Arg Leu Ser Leu Lys Lys Arg Glu Lys Lys Ala Val Leu Phe 
1105 1110 1115 1120 

Asn Phe Lys Gly Val Lys Ser Leu Asn Ala Glu Ser Leu Leu Ser Arg 

1125 1130 1135 

Val Glu Asp Leu Lys Tyr Leu Lys Asn Leu lie Asn Ser Asn Tyr Lys 

1140 1145 1150 

Asp Asp Pro Leu Lys Phe Ser Leu Gly Asn Asn Thr Pro Lys Pro Val 

1155 1160 1165 

Gin Asn Trp Ser Ser Asn Trp Thr Lys Glu Glu Asp Glu Lys Leu Leu 

1170 1175 1180 

lie Gly Val Phe Lys Tyr Gly Tyr Gly Ser Trp Thr Gin lie Arg Asp 
1185 1190 1195 1200 

Asp Pro Phe Leu Gly lie Thr Asp Lys lie Phe Leu Asn Glu Val His 

1205 1210 1215 

Asn Pro Val Ala Lys Lys Ser Ala Ser Ser Ser Asp Thr Thr Pro Thr 

1220 1225 1230 

Pro Ser Lys Lys Gly Lys Gly lie Thr Gly Ser Ser Lys Lys Val Pro 

1235 1240 1245 

Gly Ala lie His Leu Gly Arg Arg Val Asp Tyr Leu Leu Ser Phe Leu 
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1250 1255 1260 

Arg Gly Gly Leu Asn Thr Lys Ser Pro Ser Ala Asp lie Gly Ser Lys 
1265 1270 1275 1280 

Lys Leu Pro Thr Gly Pro Ser Lys Lys Arg Gin Arg Lys Pro Ala Asn 

1285 1290 1295 

His Ser Lys Ser Met Thr Pro Glu lie Thr Ser Ser Glu Pro Ala Asn 

1300 1305 1310 

Gly Pro Pro Ser Lys Arg Met Lys Ala Leu Pro Lys Gly Pro Ala Ala 

1315 1320 1325 

Leu lie Asn Asn Thr Arg Leu Ser Pro Asn Ser Pro Thr Pro Pro Leu 

1330 1335 1340 

Lys Ser Lys Val Ser Arg Asp Asn Gly Thr Arg Gin Ser Ser Asn Pro 
1345 1350 1355 1360 

Ser Ser Gly Ser Ala His Glu Lys Glu Tyr Asp Ser Met Asp Glu Glu 

1365 1370 1375 

Asp Cys Arg His Thr Met Ser Ala lie Arg Thr Ser Leu Lys Arg Leu 

1380 1385 1390 

Arg Arg Gly Gly Lys Ser Leu Asp Arg Lys Glu Trp Ala Lys lie Leu 

1395 1400 1405 

Lys Thr Glu Leu Thr Thr lie Gly Asn His lie Glu Ser Gin Lys Gly 

1410 1415 1420 

Ser Ser Arg Lys Ala Ser Pro Glu Lys Tyr Arg Lys His Leu Trp Ser 
1425 1430 1435 1440 

Tyr Ser Ala Asn Phe Trp Pro Ala Asp Val Lys Ser Thr Lys Leu Met 

1445 1450 1455 
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Ala Met Tyr Asp Lys lie Thr Glu Ser Gin 
1460 1465 



Lys 



INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Ala Val Glu Ala Asp Gly Asp Pro Asn Ala Gly Phe Glu Lys Ser Lys 

15 10 15 

Glu Leu Gly Glu lie Gin Tyr Leu lie Lys Trp Lys Gly Trp Ser His 

20 25 30 

lie His Asn Thr Trp Glu Thr Glu Glu Thr Leu Lys Gin Gin Asn Val 

35 40 45 

Lys Gly Met Asn Lys Leu Asp Asn Tyr Lys Lys 
50 55 



INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Ala Val Glu Ala Asp Gly Asp Pro Asn Ala Gly Phe Glu Arg Asn Lys 

15 10 15 

Glu Pro Gly Asp lie Gin Tyr Leu lie Lys Trp Lys Gly Trp Ser His 

20 25 30 

lie His Asn Thr Trp Glu Thr Glu Glu Thr Leu Lys Gin Gin Asn Val 

35 40 45 

Arg Gly Asn Lys Lys Leu Asp Asn Tyr Lys Lys 
50 55 



INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Glu Gly Lys Val Leu Glu Lys Thr Val Pro Asp Leu Asn Asn Cys Lys 

15 10 15 

Glu Asn Tyr Glu Phe Leu lie Lys Trp Thr Asp Glu Ser His Leu His 

20 25 30 

Asn Thr Trp Glu Thr Tyr Glu Ser lie Gly Gin Val Arg Gly Leu Lys 
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35 



40 



45 



Arg Leu Asp Asn Tyr Cys Lys 



50 



55 



INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Glu Glu Glu Glu Glu Tyr Ala Val Glu Lys lie lie Asp Arg Arg Val 

15 10 15 

Arg Lys Gly Lys Val Glu Tyr Tyr Leu Lys Trp Lys Gly Tyr Pro Glu 



Thr Glu Asn Thr Trp Glu Pro Glu Asn Asn Leu Asp Cys Gin Asp Leu 



20 



25 



30 



35 



40 



45 



He 



Gin Gin Tyr 



50 



INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 



- 109 - 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Glu Asp Glu Glu Glu Tyr Val Val Glu Lys Val Leu Asp Arg Arg Val 

15 10 15 

Val Lys Gly Lys Gin Val Glu Tyr Leu Leu Lys Trp Lys Gly Phe Ser 

20 25 30 

Glu Glu His Asn Thr Trp Glu Pro Glu Lys Asn Leu Asp Cys Pro Glu 

35 40 45 

Leu lie Ser Glu Phe 
50 

INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



Glu Glu Glu Glu Glu Tyr Val Val Glu Lys Val Leu Asp Arg Arg Val 
15 10 15 
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Val Lys Gly Lys Val Glu Tyr Leu Leu Lys Trp Lys Gly Phe Ser Asp 

20 25 30 

Glu Asp Asn Thr Trp Glu Pro Glu Glu Asn Leu Asp Cys Pro Asp Leu 

35 40 45 

He Ala Glu Phe 
50 

(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ala Glu Pro Glu Glu Phe Val Val Glu Lys Val Leu Asp Arg Arg Val 

15 10 15 

Val Asn Gly Lys Val Glu Tyr Phe Leu Lys Trp Lys Gly Phe Thr Asp 

20 25 30 

Ala Asp Asn Thr Trp Glu Pro Glu Glu Asn Leu Asp Cys Pro Glu Leu 

35 40 45 

He Glu Asp Phe 
50 
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(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Pro Val Asp Leu Val Tyr Ala Ala 
1 5 
Lys Lys Gly Val Val Glu Tyr Arg 
20 

Arg Tyr Asn Thr Trp Glu Pro Glu 

35 40 
lie Asp lie Tyr 
50 



Glu Lys lie lie Gin Lys Arg Val 

10 15 
Val Lys Trp Lys Gly Trp Asn Gin 
25 30 
Asn Asn lie Leu Asp Arg Arg Leu 

45 



(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Val Gly Glu Gin Val Phe Ala Ala Glu Cys lie Leu Ser Lys Arg Leu 

15 10 15 

Arg Lys Gly Lys Leu Glu Tyr Leu Val Lys Trp Arg Gly Trp Ser Ser 

20 25 30 

Lys His Asn Ser Trp Glu Pro Glu Glu Asn lie Leu Asp Pro Arg Leu 

35 40 45 

Leu Leu Ala Phe 
50 



(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AGATATTCTG GATCTGATAG TGATTCAATC TCGGAAAGGA AACGGCCGAA GAAACGTGGG 60 
CGACCCCGCA CTATCCCTCG GGAGAATATT AAAGGATTTA GTGATGCGGA G 111 



(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 



- 113 - 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
AGATATTCTG GATCTGATAG TGACTCCATC ACAGAAAGAA AACGGCCAAA AAAGCGTGGA 60 
AGACCTCGAA CCATTCCTCG AGAAAATATT AAAGGATTTA GTGATGCAGA G 111 



(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTCCATCTCG GAAAGGAAAC GGCCAAAAAA GCGTGGAAGA CCACGAACTA TTCCTCGAGA 60 
AAATATA 67 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



r 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AGATATTCTG GATCTGATAG TGATTCCATC TCAGAAAGAA AACGACCAAA AAAACGTGGA 60 
CGACCACGAA CTATTCCCCG TGAAAACATT AAAGGATTTA GTGATGCAGA G 111 



(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CTCCATCTCA GAAAGAAAAC GACCAAGAAA ACGTGGACGA CCACGAACTA TTCCTCGTGA 60 
AAATATT 67 



(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GATCTGATAG TGACTCCATC TCAGAAAGAA AACGACCAAG AAAACGTGGA CGACCACGAA 60 
CTATCCCTCG GGAGAATATT AAAGGATTTA GCGATGCAGA GATTAGGCGG T 111 
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(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
ATATTCTGGA TCTGATAGTG AYTC 24 



(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AGATATTCCG GATCTGATAG TGA 23 



(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
TTTCCTAAAT CGCTACGTCT 2 0 



j 



